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Hansenula polymorpha mutant strains with defect in outer chain biosynthesis and 
the production of recombinant glycoproteins using the same strains 


Field of the invention 

5 

The present invention relates to Hansenula polymorpha mutant strains 
with a defect in the outer chain biosynthesis of glycoproteins and the production 
method of recombinant glycoproteins using these strains. More specifically, the 
present invention relates to the nucleic acid molecules containing H. polymorpha 

10 HpOCHl gene, the polypeptides encoded by it 5 and H. polymorpha artificial 
mutant strains or its natural mutant strains in which hyperglycosylation of 
glycoproteins is prevented. Furthermore, the present invention relates to 
recombinant H, polymorpha strains expressing a foreign protein produced by 
transformation with a gene encoding a foreign protein, and the production method 

15 of a foreign protein, which comprises cultivating the strains under conditions that 
allow them to express the foreign protein and isolating the expressed foreign 
protein from the cultures. 

Background of the invention 

20 

In order to express a foreign protein recombinantly on a large scale, an 

optimal expression system should be selected to establish an efficient production 

system because amounts, solubility, locations and modifications etc. of expressed 
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proteins are dependent on host cell lines or features of desired proteins. For 
large-scale expression of proteins, various host systems including bacteria, yeasts, 
fungi, plants and animals have been developed. Among them, microbes have 
been wildly used to express recombinant proteins because of easy culture thus 
5 getting a high concentration of recombinant protein with a low-cost. 

Yeasts, microbes having features of the eukaryotic expression and 
secretion of proteins, are a suitable expression system to produce recombinant 
proteins of higher eukaryotes on a large-scale. In comparison to bacterial 
expression systems, yeast expression systems have a major advantage in that, as 

10 eukaryotic microbes, they have protein secretory organelles similar to those of 
higher eukaryotes. Therefore, the secretory proteins in yeast become biologically 
active through post-translational modifications such as digestion of secretory 
signal sequences, formation of disulfide bonds, glycosylation etc. Furthermore, 
the expressed recombinant proteins can be easily recovered and purified, since 

15 most yeast cells secrete only a small fraction of the proteins to the outside. 

Recently, methylotrophic yeasts such as Hansenula polymorphs Pichia 
pastoris and other non-conventional yeasts have been developed as alternative 
hosts, because they are able to replace the inherent disadvantages of the 
traditioanl yeast Saccharomyces cerevisiae as hosts for industrial production of 

20 desired proteins. The disadvantages of S. cerevisiae include instability of 
expression vectors in long-term fermentation, hyperglycosylation of 
glycoproteins, and low productivity of the expressed proteins in comparison to 
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bacterial expression system (Gellissen, Appl Microbiol. BiotechnoL 54, 741, 
(2000)). 

Most proteins utilized for medical therapeutic purposes in humans are 
glycoproteins, which are modified by attachment of oligosaccharides via covalent 
bonds in a secretory pathway. An important issue in large-scale protein 
production in the field of biotechnology is the production of recombinant proteins 
modified by suitable glycosylation because the structures and classes of 
carbohydrates attached to the glycoproteins can greatly affect folding, secretion, 
stability, half-life in serum, and antibody inductivity of the proteins. 

Wild type yeasts have some limits as an expression system. The 
recombinant glycoproteins expressed in S. cerevisiae have showed 
hypermannosylation resulting from adding over 40 mannose residues to the 
proteins and a 1,3-linked terminal mannose, which serves as an antigen in the 
human body (Romanos et al, Yeast 8, 423-488, 1992). In contrast the 
recombinant proteins expressed in methylotrophic yeasts, H. polymorpha and P. 
pastor is, have been reported to contain the mannose outer chains that are shorter 
than those expressed in S. cerevisiae although they are still more 
hyperglycosylated than native proteins (Bretthauer and Castellino, BiotechnoL 
Appl Biochem. 30, 193-200, 1999; Kang et al 9 Yeast 14, 371-381, 1998). These 
methylotrophic yeasts are preferred over the wild type S. cerevisiae as a host 
system for medical therapeutic proteins because they do not produce the a 1,3- 
linked terminal mannose, which can evoke an immune response. 
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The core oligosaccharide is an intermediate of the biosynthesis pathway, 
which is found in all eukaryotes from yeasts to mammalian cells. However, the 
outer chains attached to the intermediate are differentially biosynthesized based 
on species of proteins, cells and animals. Researchers have actively pursued the 
5 development of a useful host system to produce recombinant glycoproteins, which 
closely resemble native proteins containing proper outer chains, by means of 
selecting mutant strains with defects in outer chain biosynthesis using an artificial 
mutant method or manipulating the gene related to the chain biosynthesis using 
molecular biological techniques. 

10 

In wild type S. cerevisiae, several strategies such as [ 3 H]mannose suicide 
selection, sodium orthovanadate resistance and hygromycin B sensitivity are used 
to select the defective mutants of N-linked oligosaccharides biosynthesis 
(Herscovics and Orlean, FASEB 7, 540-550, 1999). Functional complementation 

15 experiments using these mutants led to the cloning of the OCH1 gene (Ngd29) 
playing an important role in the outer chain initiation (Nakanishi-Shindo et aL, J. 
Biol Chem. 268, 26338-26345, 1993), the MNN9 gene regulating the outer chain 
elongation and the MNN1 gene involved in attachment of the a 1,3 -linked 
terminal mannose (Gopal and Ballou, Proc. Natl Acad. Sci. USA 84, 8824, 1987). 

20 Those genes were targeted to make defective mutants by mutagenesis, which were 
then developed as a host cell to produce recombinant glycoproteins (Kniskern et 
aU Vaccine 12, 1021-1025, 1994; US Patent no. 5,798,226; US Patent no. 
5,135,854). 


WO 2004/003205 PCT/KR2003/001285 

Methylotrophic yeasts have recently been in the spotlight as a suitable host 
for recombinant protein expression over S. cerevisiae. However, a defective 
mutant of the N-linked oligosaccharide biosynthesis in methylotrophic yeasts has 
not yet been reported. 
5 The goal of this invention was to develop a mutant using H. polymorphs 

a methylotrophic yeast, which can produce recombinant glycoproteins that are 
suitable for use in the human body. This mutant was obtained by selection of a 
defective mutant in the glycosylation pathway or by mutation of the OCH1 gene 
involved in the process. This defective mutant prevents hyperglycosylation of the 
10 outer chains and is a suitable host for recombinant glycoproteins attached with 
proper outer chains, which closely resemble the native proteins. 

Summary of the invention 

15 In order to develop a defective mutant of H. polymorpha for production of 

recombinant N-linked glycoproteins closely resembling those of human, we 
developed a method for selection of a defective mutant of the oligosaccharide 
chain biosynthesis. We used sensitivity of sodium orthovanadate to select a 
defective mutant of H. polymorpha, which exhibits more resistance against it. 

20 We also cloned the OCH1 gene involved in initiation of the outer chain 
biosynthesis. The gene was mutated to make an OCH1 deletion mutant (Aochl) 
strain. This mutant strain is a suitable host, which provides techniques to produce 
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recombinant glycoproteins close to the structure of original proteins with proper 
outer oligosaccharide chains. 

Brief description of the drawings 

5 

Figure 1 shows the difference in the resistance of H. polymorpha strains, 
against sodium ortho vanadate. 

Figure 2 shows the phenotype of the H. polymorpha mutant, DL42-15. 
10 Yeast cells in a log phase were serially diluted 1 to 10 5 5 jjlI was spotted onto the 
YPD plate and the cells were cultured for 2 days. A, YPD media containing 4 mM 
sodium ortho vanadate; B 5 YDP media at 45 °C; C 5 YPD media containing 0.3% 
sodium deoxycholate; D, YDP media at 37 °C. 

15 Figure 3 shows the sequences of DNA and predicted amino acid of H.* 

polymorpha OCH1 gene cloned in this study. 

Figure 4 shows amino acid sequence alignment of the Ochlp of H. 
polymorpha with homologues of other yeast strains. The numbers in parentheses 
20 represent homology of Ochlp from other yeast strains versus Ochlp of K 
polymorpha. HpOchlp; H. polymorpha Ochl protein; ScOchlp, S. cerevisiae 
Ochl protein; ScHoclp, S. cerevisiae Hod protein; CaOchlp, C. albicans Ochl 
protein. 
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Figure 5 is an illustration showing the gene recombination and pop-out to 
induce the H. polymorpha OCH1 gene disruption 

Figure 6 shows the phenotype of the ochl defective mutant {Aochl) of K 
polymorpha. Yeast cells in a log phase were serially diluted 1 to 10, 5 fxl was 
spotted onto the YPD plate and the cells were cultured for 2 days. A, YDP media 
at' 37 °C ; B, YDP media at 45 °C ; C, YPD media containing 40 jag/ml of 
hygromycin B; D, YPD media containing 0.4% of sodium deoxycholate; E, YPD 
media containing 7 mg/ml of calcofluor white. 

Figure 7 is a Western blot demonstrating the changes in the 
oligosaccharide formation of glucose oxidase expressed in the H. polymorpha 
mutant, DL42-15, and the ochl defective strain {Aochl). 

Description of the preferred embodiment 

The present invention consists of selecting the naturally occurring sodium 
vanadate-resistant mutant strain, DL42-15, originated from H. polymorpha DL- 
1 ; cloning the H. polymorpha OCH1 gene and analyzing the DNA sequence; 
disrupting the H. polymorpha OCH1 gene; testing for the glycosylation of the 
Aspergillus niger glucose oxidase protein expressed in the sodium orthovanadate- . 
resistant strain, DL42-15, and the defective mutant strain, Aochl. 
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The invention describes engineering of the defective mutant, which was 
mutated in the outer chain biosynthesis of a methylotrophic yeast H. polymorpha 
to prevent hyperglycosylation by subsequent attachment of mannose residues. 
This mutant is an ideal host for expression of human recombinant proteins 
5 because it produces glycoproteins with fewer outer chains that more closely 
resemble the native proteins and therefore do not initiate an immune response. 
The hyperglycosylation-inhibiting mutants originated from K polymorpha DL-1 
were either a natural mutant selected by sodium orthovanadate or the mutant 
mutated in the OCH1 gene of K polymorpha. 

10 The DNA sequence (nucleotide no. 1) of H. polymorpha OCH1 cloned in 

this study was deposited in GenBank (accession no. AF490971) and in the Korean 
Collection for Type Culture (KCTC) on May 29, 2002 (accession no. KCTC 
10265BP). The sodium orthovanadate-resistant strain, DL42-15, and the OCH1 
gene-mutated strain, Aochl, of K polymorpha were also deposited in the KCTC 

15 on the same day (accession no. KCTC 10263BP and KCTC 10264BP, 
respectively). 

This invention provides the DNA and amino acid sequences shown in 
Figure 3. 

This invention provides the OCH1 gene mutant (Aochl), which inhibits 
20 hyperglycosylation of glycoproteins. 

This invention provides this mutant yeast strain as an expression host to 
express genes encoding heterologous glycoproteins. 
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This invention provides the hyperglycosylation-inhibiting mutant yeast 
strain, DL42-15, deposited in KCTC (accession no. KCTC 10263 BP). 

This invention provides this DL42-15 strain as an expression host to 
express genes encoding heterologous glycoproteins. 
5 This invention provides suitable conditions for cell culture of these 

mutants as well as methods for the production and isolation of the recombinant 
proteins from the culture. 

Methylotrophic yeasts such as H. polymorpha and P. pastoris have been 
extensively used for production of therapeutic recombinant proteins in medical 
10 and pharmaceutical industries. 

The term "hyperglycosylation-inhibiting" used in this study refers to 
reduction of the oligosaccharide chains attached to glycoproteins expressed in the 
mutants of the methylotrophic yeasts in comparison of those of the wild-type 
yeasts. 

15 The term "glycoproteins" used in this study refers to proteins processing 

glycosylation on more than one residue of asparagine, serine or threonine of 
glycoproteins in Hi polymorpha. 

Possible glycoproteins that can be produced using these invented mutants 
include, but are not limited to, the Aspergillus niger glucose oxidase, the S. 

20 cerevisiae invertase, the HIV envelop protein, the influenza A virus 
hemagglutinin, the influenza neuraminidase, the bovine herpes type-1 virus 
glycoprotein D, the human angiostatin, erythropoietin, cytokine, human B7-1, B7- 
2, B-7 receptor CTLA-4, human tissue factors, human growth factors (e.g. blood 
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platelet-derived growth factor), tissue plasminogen activator, plasminogen 
activator inhibitor- 1, eurokinase, human lysosomal enzymes (e.g. a- 
galactosidase), plasminogen, thrombin, factor XIII and immune globulin. Those 
glycoproteins can be used for therapeutic medicine delivered by injection, oral or 
5 non-oral administration or other methods used in particular areas. 

Glycoproteins produced in the mutants can be isolated and purified using 
general methods for protein isolation and purification. However, the specific 
methods employed depend on the property of the proteins to be isolated. These 
properties should be determined by the parties interested. In brief, cultured cells 
10 are collected, the secreted proteins are precipitated, and the proteins are isolated 
and purified according to a general method for protein isolation and purification 
using immune absorption, fractionation or chromatography 

The following examples explain the invention in detail, however, the 
claims are not limited to them. 

15 

Experimental example 1> 

Selection of the sodium orthovanadate-resistant mutant strain, DL42-15, of 
H. polymorpha 

20 

Even a low concentration (5 mM) of sodium orthovanadate generally 

inhibits the growth of yeast. Most S. cerevisiae vanadate-resistantmutant 

strainsare mutants with mutations in genes involved in glycosylation processing 

10 
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in the Golgi (Kanik-Ennulat et ah, Genetics 140; 933-943, 1995); Uccelletti et al. 
Res Microbiol 150:5-12, 1999). One of the most efficient methods for selection 
for oligosaccharide biosynthesis defective mutants is using sodium orthovanadate 
to select one with its resistance and this method has been extensively used in S. 
cerevisiae and Kluyveromyces lactis. However, this method cannot be used in the 
methylotrophic yeast P. pastoris because it itself is resistant to sodium 
orthovanadate (Martinet et al, Biotechnology Lett. 20, 1171-1177, 1999). In the 
case of another methylotrophic yeast H. polymorphs CBS 4732 and NCYC 495 
strains have also been reported that they can grow in the media containing 96 mM 
sodium orthovanadate (Mannazzu et aL, FEMS Microbiol Lett. 147: 23-28, 1997; 
Mannazzu et al Microbiology 144: 2589-2597, 1998). 

The H. polymorpha DL1, used in this study to develop a expression host 
for production of recombinant proteins, showed a similar sensitivity to sodium 
orthovanadate to S. cerevisiae unlike CBS 4732 and NCYC 495 (Figure 1 and 
Table 1). The natural mutant cells of H. polymorpha DL1, which became 
resistant to the sodium orthovanadate , occurred at a frequency of 1 perlO 6 cells 
on the YPD media plate containing 4 mM sodium orthovanadate. showed that 
This mutation frequency is similar to that in the wild type S. cerevisiae (Table 1). 

[Table 1] 

Growth comparison of yeast strains grown on the YPD plates containing sodium 
orthovanadate. 
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Yeast strains 

Sodium orthovanadate (mM) 

4 

6 

8 

10 

12 

S. cerevisiae L3262 (WT) 

± 





S. cerevisiae L3262 (mnn9) 

++++ 

+++ 

+ ■ 



H. polymorpha DL1 (WT) 

± 





H. polymorpha CBS4732 (WT) 

+++++ 

+++++ 

+++++ 

++++ 

+++ 


* The results were obtained after culturing at 30°C (S. cerevisiae)ox at 37°C (H. ' 
polymorpha) for 4 days. 

All the defective mutants of oligosaccharide biosynthesis among the 
5 sodium orthovanadate-resistant mutants of S. cerevisiae have been shown to be 
more sensitive to antibiotics with a large molecular- weight such as 
aminoglycoside, to synthetic detergents such as sodium deoxycholate, and to high 
temperature (Dean N., Proc. Natl Acad. Scu USA 92, 1287-1291, 1995). We 
selected 250 natural mutants from H. polymorpha DL1 showing more resistance 

10 to sodium orthovanadate, most (over 90%) of which were also resistant to 
hygromycin B. The selected mutants have been further tested on the media 
containing sodium orthovanadate at high temperature (45 °G) to select the mutant 
colonies resistant to sodium orthovanadate but sensitive to high temperature. 
Finally, the mutants have been isolated and designated as K polymorpha DL42- 

15 15 (Figure 2 and 3). 

Experimental example 2> 

Cloning and DNA sequence analysis of the K polymorpha OCH1 gene 

12 
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We analyzed the Random Sequenced Tags (RSTs) of the partial genomic 
analysis of K polymorpha (Blandin et ah, FEBS Lett 487, 76, 2000) and 
obtained the partial DNA sequences of genes showing homology with the genes 
involved in the oligosaccharide biosynthesis of S. cerevisiae. The predicted 
amino acid sequences deduced from the partial DNA sequences share homology 
with a region corresponding to the C-termini of S. cerevisiae OCH1 (ScOCHl), 
which plays an important role in attachment of al, 6-mannose in the beginning of 
the outer chain biosynthesis. S. cerevisiae ScOCHl also shares high homology 
to S. cerevisiae HOC1 {ScHOCl). A pair of primers designed based on the partial 
DNA sequences are S'-CAATCAGACCCGGTCTGTCGAGGAGT-S'faucleotide 
no. 3), S'-ACATCAACGTGGAGAACTGGGAGCAC-S' (nucleotide no. 4). Using 
these primers, we amplified by PCR a 900 bp fragment from genomic DNA 
isolated from H. polymorpha. 

We performed Southern blotting, probed with the 900 bp fragment, using 
the genomic DNAs digested with several restriction enzymes. In order to isolate 
the promoter region and full-length of the K polymorpha OCH1 gene, we gel- 
extracted the two fragments of 2.3 kb (digested with BamHI) and 5 kb (digested 
with Bglll) corresponding to the signals of the Southern blot. Each fragment was 
then cloned into a cloning vector pBluescript KS+ (Stratagen Co.). The clones 
were sequenced in both strands. 

The DNA sequence analysis revealed the clones include the promoter 

region of 1 kb and the open reading frame of 1.3 kb encoding a putative protein 

13 
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with 435 amino acids (nucleotide no. 1, Figure 3). The predicted protein of H. 
polymorpha was designated as HpOchl (amino acid sequence no. 2). This protein 
shares low homology (21-23%) to ScOCHl (accession no. YGL038C), ScHOCl 
(accession no. YJR075W) and Candida albicans Ochl (accession no. AY064420) 
proteins. However, it contains a DXD motif, a possible activation site, and the 
transmembrane spanning region in the N-terminal found in the 
mannosyltransferase, a type II membrane protein (Figure 4). 

Experimental example 3> 

Production and analysis of the OCH1 gene-mutated strain ( A ochl) of K 
polymorpha 

In order to make the mutants where the OCH1 gene was disrupted, two 

techniques, fusion PGR using the primers listed in Table 2 and in vivo DNA 

recombination, were used for the gene disruption (Oldenburg et ah, Nucleic Acid 

Res. 25, 451, 1997). The regions corresponding to the N-terminal and the C- 

terminal of URA3 and OCH1 genes, respectively, were amplified by PGR. The 

fragment corresponding to the N-terminal of HpOCHl was then fused by fusion 

PGR to the fragment corresponding to the N-terminal of URA3 while the fragment 

corresponding to the C-terminal of HpOCHl was fused to the fragment 

corresponding to the C-terminal of URA3. The fused DNA fragments were 

introduced into yeast cells to make recombination of the gene. Transformants 

14 
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where the HpOCHl gene was disrupted were then selected (Figure 5). The 
mutants were first screened on the minimal media containing no uracil, selecting 
for the URA3. marker. PGR was then performed on the genomic DNAs isolated 
from the mutants and the wild type to confirm the HpOCHl gene disruption. An 
5 H polymorpha mutant Aochl (Jeu2 ochl::URA3) was selected based on analysis 
of the PGR products. 

The selected mutant strain Aochl grows more slowly than the wild type; it 
is more sensitive to a high temperature of 45 °C and to hygromysin B; its growth 
is inhibited by addition of sodium orthovanadate and calcofluor white (Figure 6). 
10 All these properties are common in the defective mutant strains of the outer chain 
biosynthesis in yeasts, suggesting the mutant strain Aochl has a defect in the 
biosynthesis. 

[Table 2] 

15 1 Primers used in this study for PGR to disrupt the HpDCHl gene 


Name 

Oligonucleotide sequences (5' to 3') 

Nucleotide no. 

OCHl 
N-S 

ACATCAACGTGGAGAACTGG 

5 

OCHl 
N-A 

AGCTCGGTACCCGGGGATCCTGTCTGTCCACAC 
AACAGG 

6 

OCHl 
C-S 

GCACATCCCCCTTTCGCCAGCCCATACACTCCTT 
ACTAGG 

7 

OCHl 
C-A 

CAATCAGACCCGGTCTGTCGAGGAGT 

8 

URA3 
N-S 

GGATCCCCGGGTACCGAGCT 

9 

URA3 
N-A 

CACCGGTAGCTAATGATCCC 

10 

URA3 

CGAACATCCAAGTGGGCCGA 

11 
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c-s 



URA3 
C-A 

CTGGCGAAAGGGGGATGTGC 

12 


Experimental example 4> 

Analysis of the recombinant glycoproteins expressed and isolated from the mutant 
5 strains, DL42-15 and Aochl, of K polymorpha. 

In order to examine the glycosylation defect on a recombinant 
glycoprotein expressed in the mutant strains, DL42-15 and Aochl, described in 
experimental example 1 and 3 respectively, we expressed the glucose oxidase 

10 (GOD) of an Aspergillus niger gycoprotein in these mutants. The GOD protein 
contains the 8 potential sites for the N-linked glycosylation (Frederick et al^ X 
Biol Chem. 265, 3793, 1990). 

In order to express the GOD in the mutant yeast strains, we constructed a 
GOD expression vector, pDLMOX-GOD using the pDLMOX-Hir vector (Kang et 

15 ah Yeast 14, 371, 1998)). The DNA fragment containing the hirudin gene was 
first removed from the pDLMOX-Hir vector and the GOD gene fused to the 
fragment corresponding to the secretory signal of the a-amylase at the N-terminal 
was then replaced in the vector (Kim S. Y. Ph. D. Dissertations, Yonsei 
University, Korea, 2001). The resultant vector pDLMOX-GOD was introduced 

20 into the two mutant strains, DL42-15 and Aochl as well as the wild type strain, 

16 
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and they were cultured on the YPM media (1% yeast extract, 2% peptone, 2% 
methanol) to express the GOD proteins. 

The GOD proteins expressed and secreted were isolated and purified for 
Western blot anlaysis. The proteins were run on a polyarcylamide gel, transferred 
5 to a nitrocellulose membrane, and blotted using a GOD antibody. Figure 7A 
shows that the GOD proteins of the mutant strains, DL42-15 and Aochl, have a 
smaller molecular weight than that of the wild type, suggesting the proteins 
expressed and secreted in the mutants are less hyperglycosylated, or in other 
words, hyperglycosylation is inhibited in the mutant strains. To confirm the 

10 blotting result, we treated all the proteins with endoglycosidase H enzyme to 
digest the oligosaccharide chains attached on the proteins, and repeated the blot. 
Figure 7B shows that all the proteins have the same molecular weight on the blot, 
suggesting they are all the same proteins. These results demonstrate that the 
proteins expressed and secreted in the mutant cells were smaller than the one 

15 expressed and secreted in wild type cells due to less hyperglycosylation on the 
proteins. Therefore, the mutant strains, DL42-15 and Aochl, unlike the wild type, 
are suitable host cells to produce the human glycoproteins, in which the 
hypergylcosylation of the proteins will be inhibited, resulting in a closer 
resemblance to native human proteins. 

20 

Possible application of the invention to industries 
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The H. polymorpha mutants, DL42-15 and Aochl, are able to be used as 
host cells to produce recombinant glycoproteins, which will express and secrete 
the proteins containing proper outer oligosaccharide chains closely resembling 
the native proteins because the hyperglycosylation of the proteins is inhibited in 
5 the mutants cells. These mutants will be useful in the medical therapeutic industry 
because K polymorpha yeast cells has been broadly used to produce medical 
therapeutic recombinant proteins on a large scale. 
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File reference PCTA/KRIB/2 


International application No. 

PCT/KR03/01285 
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What Is Claimed Is: 

1. A nucleic acid molecule comprising the DNA sequence shown in Figure 3. 

5 2. The nucleic acid molecule according to claim 1, wherein the nucleic acid 
molecule is Hansenula polymorpha HpOCHl gene (KCTC 10265BP). 

3. A polypeptide comprising the amino acid sequence shown in Figure 3. 

10 4. K polymorpha mutant strains which prevent the hyperglycosylation of 
glycoproteins by mutation of the HpOCHl gene. 

5. A H polymorpha mutant strain Aochl (KCTC 10264BP) according to claim 4, 
wherein the HpOCHl gene is disrupted. . 

15 

6. A recombinant H polymorpha strain expressing a foreign protein, wherein the 
recombinant strain is produced by introducing the gene encoding the foreign 
protein into H. polymorpha strain according to claim 4. 

20 7. A recombinant K polymorpha strain according to claim 6 expressing a foreign 
protein, wherein the recombinant strain is produced by introducing the gene 
encoding the foreign protein into the disrupted mutant strain, Aochl . 

22 
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8. A natural mutant strain, DL-42-15, originated from H. polymorpha DL1 strain, 
prevented from hyperglycosylation of glycoproteins (KCTC 10263BP). 


9. A recombinant K polymorpha strain expressing a foreign protein, wherein the 
recombinant strain is produced by introducing the gene encoding the foreign 
protein into H. polymorpha mutant strain, DL-42-15 according to claim 8 

10. A method for producing a foreign protein, wherein the method comprises 
cultivating the recombinant strains according to any one of claims 6, 7, and 9 
under conditions that allow the strains to express the foreign protein and isolating 
the expressed foreign protein from the cultures. 

11. The method according to claim 10, wherein hyperglycosylation of a foreign 
protein is prevented. 
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Sequence Listing 


<1L0> Korea Research Institute of Bioscience and Biotechnology 

<120> Hansenula polymorpha mutant strains with defect in outer chain 
biosynthesis and the production of recombinant glycoproteins 
using the same strains 

<1S0> 12 

<170> Kopatentln 1.71 

<210> 1 

<211^» 2462 

<212> DNA 

<213> Hansenula polymorpha 
<220> 

<221> sigjpeptide 

<222> (850) . . (852) 

<223> initiation codon 

<220> 

<221> 5'UTR 

<222> (1) . . (849) 

<220> 

<221> 3'UTR 

<222> (2158) . . (2462) 

<220> 

<221> terminator 

<222> (21SS) . . (2157) 

<223> termination codon 

<220> 

<221> CDS 

<222> (350) . . {2154 ) 

<223> coding sequence 
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Sequence Listing 

<400> 1 

gtcctttgga atattctcct ccggcagcca gcctcctttg gaggtcaatc caggacacat so 

aatattggct cccgagagca caaatttgat cgctcccctg tcgacctgga ctctagggaa 12 0 

cagagtagga tatttgtgca cgagcttgag tgatggcagc aattcgtcga atttctggta 180 

aaacagcacc tctccatcga ctaagtagag ctggattttg tcctcgcatc tatagtgctg 24 0 

aacggtggac ttcttgggca cgagctcgtc gatgatgggc tcaaagccgg gatattgctc 300 

cacaaacctc gttttgaggc cacgctgcac ggaagacttg acgtttgacc gggtgtggac 360 

gtcttcttta gagaatttct tgaacataag ggaaataaga agctagacaa gtagatgaaa 420 

aaaaacacat atttcgacgc ttagtccatg cgtcgatcga cgtgaaacat gagttaaggg 480 

gtaacgccca tttaggcgga aaatgagcct gatggattga taatatgtat caggagttgt 540 

tgaagtgtct attatcgatt cggcgcacaa aatccgctca ttttgcaatg ttccgacatt 600 

cttaattaac acttcgcgtc cgccttagca cacgaaaaga tctacagaac gaaaaaaaaa 660 

atagacaact aaacagaaag cttttttata aactcggcta cgatcagctt tttgtaatac 720 

atttttgctt tggattgcta cgagaacatt caaaactgca ggaaagattc gcagtatctt 78 0 

tctctttttt acaagatccg agtttgccta atattcgact gaaactcgcc gccacattct 84 0 

cttagtgat atg age aaa gca teg cct tac aga ggc ate aac teg aca teg 891 
Met Ser Lys Ala Ser Pro Tyr Arg Gly He Asn Ser Thr Ser 
15 10 

teg acg tct cca aag ttt aaa aaa ctg tec att ttt gtg ggg ttg ctg 93 9 
Ser Thr Ser Pro Lys Phe Lys Lys Leu Ser lie Phe Val Gly Leu Leu 
15 20 25 30 

ctg ggc ctg act ctg etc aaa etc tec ace age tgg tct ace aac acg 98 7 
Leu Gly Leu He Leu Phe Lys Phe Ser Thr Ser Trp Ser He Asn Thr 
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Sequence Listing 


35 40 45 

gag gac aag ate gtt teg gaa tat etc aac aac ttc tac aag eta aat 1035 
Glut Asp Lys lie Val Ser Glu Tyr Leu Asn Asn Phe Tyr Lys Leu Asn 
50 55 SO 

cca aaa ttc cgc ggt gec aac ccg tac gac gcg gca gtc act gca gag 1083 
Pro Lys Phe Arg Gly Ala Asn Pro Tyr Asp Ala Ala Val Thr Ala Glu 
65 70 75 

aga ctg gec aag ttc ttc cca tat gac aac agt gec aga aga ate gag 1131 
Arg Leu Ala Lys Phe Phe Pro Tyr Asp Asn Ser Ala Arg Arg Xle Glu 
80 85 90 

aag age ate tgg cag atg tgg aag gtg cct tec acc gac cca gac etc 1179 
Lys Ser lie Trp Gin Met Trp Lys Val Pro Ser Thr Asp Pro Asp Phe 
95 100. 105 no 

cct cac aag gag eta gtg aac aag tgg aaa aat gag aac cca acc tac 1227 
Pro His Lys Glu Leu Val Asn Lys Trp Lys Asn Glu Asn Pro Thr Tyr 
115 120 12S 

aaa tac aac ctg ctg act gac gac gag ate tta gag att ttg aga ate 1275 
Lys Tyr Asn Leu Leu Thr Asp Asp Glu lie Leu Glu He Leu Arg lie 
130 135 140 

egg ttc aaa gac acc gtt cct gag gtg etc gag gcg ttc gag atg ttg 1323 
Arg Phe Lys Asp Thr Val Pro Glu Val Leu Glu Ala Phe Glu Met Leu 
145 ISO 155 

cca aac aaa ate ate egg tec gac ttc get aga tac ctg ctg att ttc 1371 
Pro Asn Lys He He Arg Ser Asp Phe Ala Arg Tyr Leu Leu He Phe 
ISO 165 170 

ctg aac ggc ggt gtc tac gca gac ate gac aca gac ctg caa aag cca 1419 
Leu Asn Gly Gly Val Tyr Ala Asp He Asp Thr Asp Leu Gin Lys Pro 
175 130 IBS 190 

gtc gac acg tgg tec gac tct gac aga aat gtg gga cct gcg gtc gee 14 67 

Val Asp Thr Trp Phe Asp Ser Asp Arg Asn Val Gly Phe Val Val Ala 
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Sequence Listing 


195 200 205 

gtc gag gag gac ate aac gtg gag aac tgg gag cac tac atg acc aga 1515 
Val Glu Glu Asp He Asn Val Glu Asn Trp Glu His Tyr Met Thr Arg 
210 215 220 


aga ate cag ttt gag cag tgg aca ttc aag gec aag gca aaa cat cct 1563 

Arg He Gin Phe Glu Gin Trp Thr Phe Lys Ala Lys Ala Lys His Pro 
225 230 235 

att ttg aga aag ctg att gca aag ate gtc gaa acc act ttc cag gec 1611 

He Leu Arg Lys Leu He Ala Lys He Val Glu Thr Thr Phe Gin Ala ' 
240 245 250 


aag aag aac gac aaa ctg cag get tac tac aaa gat ttc aaa ggc gtc 165 9 

Lys Lys Asn Asp Lys Leu Gin Ala Tyr Tyr Lys Asp Phe Lys Gly Val 
255 ' 260 265 270 

gat aga tgt get tec gtg gat ate atg gtg tgg acc ggt cct gtt gtg 1707 
Asp Arg Cys Ala Ser Val Asp He Met Val Trp Thr Gly Pro Val Val 
275 280 285 


tgg aca gac act ate tat gcg cac ctg aac teg ate cca age cca acg L755 
Trp Thr Asp Thr He Tyr Ala His Leu Asn Ser He Pro Ser Pro Thr 
290 295 300 


att gtc gac ata gac cac caa aga gac att gcg gga gag etc tat ggc 18 03 

He Val Asp He Asp His Gin Arg Asp He Ala Gly Glu Leu Tyr Gly 
305 310 315 


cct gag acc gga gaa gga gac gtg att teg tgg aga ttc ttt get ggt 1851 
Pro Glu Thr Gly Glu Gly Asp Val He Ser Trp Arg Phe Phe Ala Gly 
320 325 330 


tta aga get cct gtg atg ate gac gac gtg gtc att tat cca agg gec 1899 
Leu Arg Ala Pro Val Met He Asp Asp Val Val He Tyr Pro Arg Ala 
335 340 345 350 


tec ttc aga gag gac aag gag aac aat tgc gga aaa tac tgc tac get 1947 
Ser Phe Arg Glu Asp Lys Glu Asn Asn Cys Gly Lys Tyr Cys Tyr Val 
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355 360 365 

cac cac cac ttt gga gga tec tgg aaa aac aac gga aag ggc gag ate 1995 
His His His Phe Gly Gly Ser Trp Lys Asn Asn Gly Lys Gly Glu He 
370 375 380 

aag cca ggc atg gag gga tac gaa gga gag gat cca aac gaa gtg gaa 2 04 3 

Lys Pro Gly Met Glu Gly Tyr Glu Gly Glu Asp Pro Asn Glu Val Glu 
385 390 395 

gag etc aga aag aac gat gtc age aag agg gac gtt att cct cat gag 2091 
Glu Leu Arg Lys Asn Asp Val Ser Lys Arg Asp Val lie Pro Gly Glu 
400 405 410 

tct aaa gac gtt get ccg gtg aag aag ctt gec aag aga tgt gcg tac 213 9 

Ser Lys Asp Val Ala Pro Val Lys Lys Leu Ala Lys Arg Cys Ala Tyr 
43 -5 420 425 430 

cca tac act cct tac taggta agtagtcgaa cattctaata agatggatag 2190 

Pro Tyr Thr Pro Tyr 
435 

tgtatttatt atgtaagctg gaaaaaaaac attttttaga gecaaaaaac caaaaagttt 2250 

tccatcaacg cacatgtaca gtctaattct taggeggctg aactcgacag ctgcgctgcc 2310 

ggccgtgtcg aaaagacagc tgatctcgcc gatggctgtt ccagccctca aaagtgtggc 2370 

cccgctcccc ttcaagagee cagaggecaa aagactgege aaaagactcc tcgacagacc 243 0 

gggtctgatt cgcgtcaagc geggcatgae gc 2 4 62 


<210> 2 
<211> 435 
<212> PRT 

<213> Hansenula polymorpha 
<400> 2 

Met Ser Lys Ala Ser Pro Tyr Arg Gly He Asn Ser Thr Ser Ser Thr 
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15 10 15 

Ser Pro L»ys Phe Lys Lys Leu Ser He Phe Val Gly Leu Leu Leu Gly 
20 25 30 

Leu He Leu Phe Lys Phe Ser Thr Ser Trp Ser lie Asn Thr Glu Asp 
35 40 45 

Lys He Val Ser Glu Tyr Leu Asn Asn Phe Tyr Lys Leu Asn Pro Lys 
50 55 60 

Phe Arg Gly Ala Asn Pro Tyr Asp Ala Ala Val Thr Ala Glu Arg Leu 
65 70 75 80 

Ala Lys Phe Phe Pro Tyr Asp Asn Ser Ala Arg Arg He Glu Lys Ser 
85 90 95 

He Trp Gin Met Trp Lys Val Pro Ser Thr Asp Pro Asp Phe Pro His 
100 105 110 

Lys Glu Leu Val Asn Lys Trp Lys Asn Glu Asn Pro Thr Tyr Lys Tyr 
115 120 125 

Asn Leu Leu Thr Asp Asp Glu He Leu Glu He Leu Arg He Arg Phe 
130 135 140 

Lys Asp Thr Val Pro Glu Val Leu Glu Ala Phe Glu Met Leu Pro Asn 
145 150 155 160 

Lys He He Arg Ser Asp Phe Ala Arg Tyr Leu Leu He Phe Leu Asn 
165 170 175 

Gly Gly Val Tyr Ala Asp He Asp Thr Asp Leu Gin Lys Pro Val Asp 
190 185 190 

Thr Trp Phe Asp Ser Asp Arg Asn Val Gly Phe Val Val Ala Val Glu 
195 200 -05 

Glu Asp He Asn Val Glu Asn Trp Glu His Tyr Met Thr Arg Arg He 
210 215 220 
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Gin Phe Giu Gin Trp Thr Phe Lys Ala Lys Ala Lys His Pro lie Leu 
225 230 23S 240 

Arg Lys Leu lie Ala Lys lie Val Glu Thr Thr Phe Gin Ala Lys Lys 
245 250 255 

Asn Asp Lys Leu Gin Ala Tyr Tyr Lys Asp Phe Lys Gly Val Asp Arg 
260 265 270 

Cys Ala Ser Val Asp He Met Val Trp Thr Gly Pro Val Val Trp Thr 
275 280 285 

Asp Thr He Tyr Ala His Leu Asn Ser lie Pro Ser Pro Thr He Val 
290 295 300 


Asp He Asp His Gin Arg Asp He Ala Gly Glu Leu Tyr Gly Pro Glu 
305 310 315 320 

Thr Gly Glu Gly Asp Val He Ser Trp Arg Phe Phe Ala Gly Leu Arg 
325 330 335 

Ala Pro Val Met He Asp Asp Val Val He Tyr Pro Arg Ala Ser Phe 
34Q 345 350 


Arg Glu Asp Lys Glu Asn Asn Cys Gly Lys Tyr Cys Tyr Val His His 
355 360 365 

His Phe Gly Gly Ser Trp Lys Asn Asn Gly Lys Gly Glu He Lys Pro 
370 375 380 

Gly Met Glu Gly Tyr Glu Gly Glu Asp Pro Asn Glu Val Glu Glu Leu 
385 390 395 400 

Arg Lys Asn Asp Val Ser Lys Arg Asp Val He Pro Gly Glu Ser Lys 
40S 410 415 

Asp Val Ala Pro Val Lys Lys Leu Ala Lys Arg Cys Ala Tyr ?ro Tyr 
420 425 430 
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Thr Pro Tyr 
435 


<2L0> 3 

<211> 26 

<2L2> DNA 

<2L3> Artificial Sequence 
<220> 

<:223> primer 

<400> 3 

caatcagacc cggtctgtcg aggagt 2 6 

<210> 4 

<=211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 4 

acatcaacgt ggagaactgg gagcac 2 6 

<210> 5 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 5 
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Sequence Listing 


acatcaacgt ggagaactgg 20 


<210> 6 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 


<400> 6 

agctcggtac ccggggatcc tgtctgtcca cacaacagg 39 


<210> 7 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oriraer 


<400> 7 

gcacatcccc ctttcgccag cccatacact ccttactagg 40 


<210> 8 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 


<400> 3 

caatcagacc cggtccgtcg aggagc 
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<210> 9 

<211> 20 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> primer 


<400> 9 

ggatccccgg gtaccgagct 20 


<2L0> 10 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 


<400> 10 

caccggtagc taatgatccc 20 


<210> 11 

<211> 20 

<21"2> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 


<400> 11 

cgaacatcca agtgcgccga -° 
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<210> 12 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 


<400> 12 

ctggcgaaag ggggatgtgc 20 
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